Nonparametric Evaluation of Quantitative Traits in Population-Based Association Studies when the Genetic Model is Unknown
نویسندگان
چکیده
Statistical association between a single nucleotide polymorphism (SNP) genotype and a quantitative trait in genome-wide association studies is usually assessed using a linear regression model, or, in the case of non-normally distributed trait values, using the Kruskal-Wallis test. While linear regression models assume an additive mode of inheritance via equi-distant genotype scores, Kruskal-Wallis test merely tests global differences in trait values associated with the three genotype groups. Both approaches thus exhibit suboptimal power when the underlying inheritance mode is dominant or recessive. Furthermore, these tests do not perform well in the common situations when only a few trait values are available in a rare genotype category (disbalance), or when the values associated with the three genotype categories exhibit unequal variance (variance heterogeneity). We propose a maximum test based on Marcus-type multiple contrast test for relative effect sizes. This test allows model-specific testing of either dominant, additive or recessive mode of inheritance, and it is robust against variance heterogeneity. We show how to obtain mode-specific simultaneous confidence intervals for the relative effect sizes to aid in interpreting the biological relevance of the results. Further, we discuss the use of a related all-pairwise comparisons contrast test with range preserving confidence intervals as an alternative to Kruskal-Wallis heterogeneity test. We applied the proposed maximum test to the Bogalusa Heart Study dataset, and gained a remarkable increase in the power to detect association, particularly for rare genotypes. Our simulation study also demonstrated that the proposed non-parametric tests control family-wise error rate in the presence of non-normality and variance heterogeneity contrary to the standard parametric approaches. We provide a publicly available R library nparcomp that can be used to estimate simultaneous confidence intervals or compatible multiplicity-adjusted p-values associated with the proposed maximum test.
منابع مشابه
Marginal Analysis of A Population-Based Genetic Association Study of Quantitative Traits with Incomplete Longitudinal Data
A common study to investigate gene-environment interaction is designed to be longitudinal and population-based. Data arising from longitudinal association studies often contain missing responses. Naive analysis without taking missingness into account may produce invalid inference, especially when the missing data mechanism depends on the response process. To address this issue in the ana...
متن کاملThe Pattern of Linkage Disequilibrium in Livestock Genome
Linkage disequilibrium (LD) is bases of genomic selection, genomic marker imputation, marker assisted selection (MAS), quantitative trait loci (QTL) mapping, parentage testing and whole genome association studies. The Particular alleles at closed loci have a tendency to be co-inherited. In linked loci this pattern leads to association between alleles in population which is known as LD. Two metr...
متن کاملPosterior Computation for Hierarchical Dirichlet Process Mixture Models: Application to Genetic Association Studies of Quantitative Traits in the the Presence of Population Stratification
In ?, we introduced a unified hierarchical Bayesian semiparametric model for genetic association studies of quantitative traits in the presence of population stratification. The model uses a Dirichlet Process Mixture (DPM) construction to account for stratification in making association inference. It also involves a nonparametric sparsity prior to accommodate the expectation that most genetic m...
متن کاملAssociation analysis for traits associated with powdery mildew tolerance in barley [Hordeum vulgare L.] using AFLP markers
Association analysis is a useful method for evaluation of significant association between molecular marker and phenotype of trait. This study was performed to evaluate association between traits related with powdery mildew resistance and molecular markers. This investigation was performed using 77 barley genotypes and AFLP markers. In phenotypic evaluation, reaction of seedlings to powdery mild...
متن کاملDetecting Genetic Interactions for Quantitative Traits Using m-Spacing Entropy Measure
A number of statistical methods for detecting gene-gene interactions have been developed in genetic association studies with binary traits. However, many phenotype measures are intrinsically quantitative and categorizing continuous traits may not always be straightforward and meaningful. Association of gene-gene interactions with an observed distribution of such phenotypes needs to be investiga...
متن کاملAssociation Analysis for Important Quantitative and Morphological Traits in Cultivars and Advanced Lines of Soybean (Glycine max (L.)) using Microsatellite Markers
IExtended Abstract Introduction and Objective: The economic value of a genotype depends on its various traits and therefore the accurate knowledge of genetic behavior and identification of genomic locus involved in controlling these traits can help the breeder to improve genotypes. Material and Methods: In this study, the relationship between microsatellite markers with some important agrono...
متن کامل